Revision and Co-revision in Wikipedia : Detecting Clusters of Interest
نویسندگان
چکیده
The online encyclopedia Wikipedia gives rise to a multitude of network structures such as the citation network of its pages or the coauthorship network of users. In this paper we analyze another network that arises from the fact that Wikipedia articles undergo perpetual editing. It can be observed that the edit volume of Wikipedia pages varies strongly over time, often triggered by news events related to their content. Furthermore, some pages show remarkably parallel behavior in their edit variance in which case we add a co-revision link connecting them. The goal of this paper is to assess the meaningfulness of the co-revision network. Specific tasks are to understand the influence of normalization (e.g., correlation vs. covariance) and to determine differences between the co-revision network and other relations on Wikipedia pages, such as similarity by author-overlap.
منابع مشابه
Predicting Edit Locations on Wikipedia using Revision History
There has been increasing interest in the machine learning community in automatic task design. In a collaborative problem-solving setting, how can we best break up and assign tasks so as to optimize output? Huang et al., for example, considered the problem of effectively assigning image-labeling tasks to Amazon Mechanical Turkers [1]. In the realm of Wikipedia prediction, Cosley et al. created ...
متن کاملUsing Language Models to Detect Wikipedia Vandalism
This paper explores a statistical language modeling approach for detecting Wikipedia vandalism. Wikipedia is a popular and influential collaborative information system. The collaborative nature of authoring, as well as the high visibility of its content, have exposed Wikipedia articles to vandalism, defined as malicious editing intended to compromise the integrity of the content of articles. Ex...
متن کاملMeasuring Contextual Fitness Using Error Contexts Extracted from the Wikipedia Revision History
We evaluate measures of contextual fitness on the task of detecting real-word spelling errors. For that purpose, we extract naturally occurring errors and their contexts from the Wikipedia revision history. We show that such natural errors are better suited for evaluation than the previously used artificially created errors. In particular, the precision of statistical methods has been largely o...
متن کاملThe Effect of Multi-step Oral-revision Processes on Iranian EFL Learners’ Argumentative Writing Achievement
The purpose of this study was to explore the role of two multi-step oral-revision processes as feedback providing tools on Iranian EFL learners’ argumentative writing achievement. The participants taking part in this study were 45 Iranian EFL students who were randomly assigned into three groups. The participants of the groups were given three argumentative writing assignments, each assignment ...
متن کاملDetecting Wikipedia Vandalism using WikiTrust
WikiTrust is a reputation system for Wikipedia authors and content. WikiTrust computes three main quantities: edit quality, author reputation, and content reputation. The edit quality measures how well each edit, that is, each change introduced in a revision, is preserved in subsequent revisions. Authors who perform good quality edits gain reputation, and text which is revised by several high-r...
متن کامل